Picture for Xuefeng Li

Xuefeng Li

daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

Add code
Feb 02, 2026
Viaarxiv icon

daVinci-Dev: Agent-native Mid-training for Software Engineering

Add code
Jan 27, 2026
Viaarxiv icon

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling

Add code
Jan 06, 2026
Viaarxiv icon

ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models

Add code
Aug 26, 2025
Figure 1 for ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models
Figure 2 for ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models
Figure 3 for ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models
Figure 4 for ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models
Viaarxiv icon

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Add code
May 26, 2025
Figure 1 for Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles
Figure 2 for Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles
Figure 3 for Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles
Figure 4 for Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles
Viaarxiv icon

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

Add code
Apr 21, 2025
Figure 1 for Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Figure 2 for Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Figure 3 for Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Figure 4 for Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Viaarxiv icon

ToRL: Scaling Tool-Integrated RL

Add code
Mar 30, 2025
Viaarxiv icon

Multi-modal expressive personality recognition in data non-ideal audiovisual based on multi-scale feature enhancement and modal augment

Add code
Mar 08, 2025
Figure 1 for Multi-modal expressive personality recognition in data non-ideal audiovisual based on multi-scale feature enhancement and modal augment
Figure 2 for Multi-modal expressive personality recognition in data non-ideal audiovisual based on multi-scale feature enhancement and modal augment
Figure 3 for Multi-modal expressive personality recognition in data non-ideal audiovisual based on multi-scale feature enhancement and modal augment
Figure 4 for Multi-modal expressive personality recognition in data non-ideal audiovisual based on multi-scale feature enhancement and modal augment
Viaarxiv icon

LIMR: Less is More for RL Scaling

Add code
Feb 17, 2025
Viaarxiv icon

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Add code
Nov 25, 2024
Viaarxiv icon